Overview
Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 10000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.1 MiB |
| Average record size in memory | 112.0 B |
Variable types
| Numeric | 7 |
|---|---|
| Text | 1 |
| Categorical | 6 |
RowNumber is uniformly distributed | Uniform |
RowNumber has unique values | Unique |
CustomerId has unique values | Unique |
Tenure has 413 (4.1%) zeros | Zeros |
Balance has 3617 (36.2%) zeros | Zeros |
Reproduction
| Analysis started | 2025-12-25 18:09:08.707568 |
|---|---|
| Analysis finished | 2025-12-25 18:09:24.730288 |
| Duration | 16.02 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
RowNumber
Real number (ℝ)
Uniform Unique
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5000.5 |
| Minimum | 1 |
|---|---|
| Maximum | 10000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 500.95 |
| Q1 | 2500.75 |
| median | 5000.5 |
| Q3 | 7500.25 |
| 95-th percentile | 9500.05 |
| Maximum | 10000 |
| Range | 9999 |
| Interquartile range (IQR) | 4999.5 |
Descriptive statistics
| Standard deviation | 2886.8957 |
|---|---|
| Coefficient of variation (CV) | 0.5773214 |
| Kurtosis | -1.2 |
| Mean | 5000.5 |
| Median Absolute Deviation (MAD) | 2500 |
| Skewness | 0 |
| Sum | 50005000 |
| Variance | 8334166.7 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| Other values (9990) | 9990 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 10000 | 1 | |
| 9999 | 1 | |
| 9998 | 1 | |
| 9997 | 1 | |
| 9996 | 1 | |
| 9995 | 1 | |
| 9994 | 1 | |
| 9993 | 1 | |
| 9992 | 1 | |
| 9991 | 1 |
CustomerId
Real number (ℝ)
Unique
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15690941 |
| Minimum | 15565701 |
|---|---|
| Maximum | 15815690 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 15565701 |
|---|---|
| 5-th percentile | 15578824 |
| Q1 | 15628528 |
| median | 15690738 |
| Q3 | 15753234 |
| 95-th percentile | 15803034 |
| Maximum | 15815690 |
| Range | 249989 |
| Interquartile range (IQR) | 124705.5 |
Descriptive statistics
| Standard deviation | 71936.186 |
|---|---|
| Coefficient of variation (CV) | 0.0045845681 |
| Kurtosis | -1.1961125 |
| Mean | 15690941 |
| Median Absolute Deviation (MAD) | 62432.5 |
| Skewness | 0.0011491459 |
| Sum | 1.5690941 × 1011 |
| Variance | 5.1748149 × 109 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 15634602 | 1 | < 0.1% |
| 15647311 | 1 | < 0.1% |
| 15619304 | 1 | < 0.1% |
| 15701354 | 1 | < 0.1% |
| 15737888 | 1 | < 0.1% |
| 15574012 | 1 | < 0.1% |
| 15592531 | 1 | < 0.1% |
| 15656148 | 1 | < 0.1% |
| 15792365 | 1 | < 0.1% |
| 15592389 | 1 | < 0.1% |
| Other values (9990) | 9990 |
| Value | Count | Frequency (%) |
| 15565701 | 1 | |
| 15565706 | 1 | |
| 15565714 | 1 | |
| 15565779 | 1 | |
| 15565796 | 1 | |
| 15565806 | 1 | |
| 15565878 | 1 | |
| 15565879 | 1 | |
| 15565891 | 1 | |
| 15565996 | 1 |
| Value | Count | Frequency (%) |
| 15815690 | 1 | |
| 15815660 | 1 | |
| 15815656 | 1 | |
| 15815645 | 1 | |
| 15815628 | 1 | |
| 15815626 | 1 | |
| 15815615 | 1 | |
| 15815560 | 1 | |
| 15815552 | 1 | |
| 15815534 | 1 |
Surname
Text
| Distinct | 2932 |
|---|---|
| Distinct (%) | 29.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
Length
| Max length | 23 |
|---|---|
| Median length | 16 |
| Mean length | 6.4349 |
| Min length | 2 |
Unique
| Unique | 1558 ? |
|---|---|
| Unique (%) | 15.6% |
Sample
| 1st row | Hargrave |
|---|---|
| 2nd row | Hill |
| 3rd row | Onio |
| 4th row | Boni |
| 5th row | Mitchell |
| Value | Count | Frequency (%) |
| lo | 33 | 0.3% |
| smith | 32 | 0.3% |
| scott | 29 | 0.3% |
| martin | 29 | 0.3% |
| walker | 28 | 0.3% |
| brown | 26 | 0.3% |
| genovese | 25 | 0.2% |
| shih | 25 | 0.2% |
| yeh | 25 | 0.2% |
| wright | 24 | 0.2% |
| Other values (2931) | 9779 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5799 | 9.0% |
| e | 5764 | 9.0% |
| n | 5235 | 8.1% |
| o | 4905 | 7.6% |
| i | 4491 | 7.0% |
| r | 3547 | 5.5% |
| l | 2921 | 4.5% |
| s | 2592 | 4.0% |
| u | 2552 | 4.0% |
| h | 2150 | 3.3% |
| Other values (45) | 24393 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 64349 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 5799 | 9.0% |
| e | 5764 | 9.0% |
| n | 5235 | 8.1% |
| o | 4905 | 7.6% |
| i | 4491 | 7.0% |
| r | 3547 | 5.5% |
| l | 2921 | 4.5% |
| s | 2592 | 4.0% |
| u | 2552 | 4.0% |
| h | 2150 | 3.3% |
| Other values (45) | 24393 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 64349 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 5799 | 9.0% |
| e | 5764 | 9.0% |
| n | 5235 | 8.1% |
| o | 4905 | 7.6% |
| i | 4491 | 7.0% |
| r | 3547 | 5.5% |
| l | 2921 | 4.5% |
| s | 2592 | 4.0% |
| u | 2552 | 4.0% |
| h | 2150 | 3.3% |
| Other values (45) | 24393 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 64349 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 5799 | 9.0% |
| e | 5764 | 9.0% |
| n | 5235 | 8.1% |
| o | 4905 | 7.6% |
| i | 4491 | 7.0% |
| r | 3547 | 5.5% |
| l | 2921 | 4.5% |
| s | 2592 | 4.0% |
| u | 2552 | 4.0% |
| h | 2150 | 3.3% |
| Other values (45) | 24393 |
CreditScore
Real number (ℝ)
| Distinct | 460 |
|---|---|
| Distinct (%) | 4.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 650.5288 |
| Minimum | 350 |
|---|---|
| Maximum | 850 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 350 |
|---|---|
| 5-th percentile | 489 |
| Q1 | 584 |
| median | 652 |
| Q3 | 718 |
| 95-th percentile | 812 |
| Maximum | 850 |
| Range | 500 |
| Interquartile range (IQR) | 134 |
Descriptive statistics
| Standard deviation | 96.653299 |
|---|---|
| Coefficient of variation (CV) | 0.14857651 |
| Kurtosis | -0.42572568 |
| Mean | 650.5288 |
| Median Absolute Deviation (MAD) | 67 |
| Skewness | -0.071606608 |
| Sum | 6505288 |
| Variance | 9341.8602 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 850 | 233 | 2.3% |
| 678 | 63 | 0.6% |
| 655 | 54 | 0.5% |
| 705 | 53 | 0.5% |
| 667 | 53 | 0.5% |
| 684 | 52 | 0.5% |
| 651 | 50 | 0.5% |
| 670 | 50 | 0.5% |
| 683 | 48 | 0.5% |
| 652 | 48 | 0.5% |
| Other values (450) | 9296 |
| Value | Count | Frequency (%) |
| 350 | 5 | |
| 351 | 1 | < 0.1% |
| 358 | 1 | < 0.1% |
| 359 | 1 | < 0.1% |
| 363 | 1 | < 0.1% |
| 365 | 1 | < 0.1% |
| 367 | 1 | < 0.1% |
| 373 | 1 | < 0.1% |
| 376 | 2 | < 0.1% |
| 382 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 850 | 233 | |
| 849 | 8 | 0.1% |
| 848 | 5 | 0.1% |
| 847 | 6 | 0.1% |
| 846 | 5 | 0.1% |
| 845 | 6 | 0.1% |
| 844 | 7 | 0.1% |
| 843 | 2 | < 0.1% |
| 842 | 7 | 0.1% |
| 841 | 12 | 0.1% |
Geography
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
| France | |
|---|---|
| Germany | |
| Spain |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 6.0032 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | France |
|---|---|
| 2nd row | Spain |
| 3rd row | France |
| 4th row | France |
| 5th row | Spain |
Common Values
| Value | Count | Frequency (%) |
| France | 5014 | |
| Germany | 2509 | |
| Spain | 2477 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| france | 5014 | |
| germany | 2509 | |
| spain | 2477 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 10000 | |
| a | 10000 | |
| r | 7523 | |
| e | 7523 | |
| F | 5014 | |
| c | 5014 | |
| G | 2509 | 4.2% |
| m | 2509 | 4.2% |
| y | 2509 | 4.2% |
| S | 2477 | 4.1% |
| Other values (2) | 4954 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 60032 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 10000 | |
| a | 10000 | |
| r | 7523 | |
| e | 7523 | |
| F | 5014 | |
| c | 5014 | |
| G | 2509 | 4.2% |
| m | 2509 | 4.2% |
| y | 2509 | 4.2% |
| S | 2477 | 4.1% |
| Other values (2) | 4954 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 60032 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 10000 | |
| a | 10000 | |
| r | 7523 | |
| e | 7523 | |
| F | 5014 | |
| c | 5014 | |
| G | 2509 | 4.2% |
| m | 2509 | 4.2% |
| y | 2509 | 4.2% |
| S | 2477 | 4.1% |
| Other values (2) | 4954 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 60032 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 10000 | |
| a | 10000 | |
| r | 7523 | |
| e | 7523 | |
| F | 5014 | |
| c | 5014 | |
| G | 2509 | 4.2% |
| m | 2509 | 4.2% |
| y | 2509 | 4.2% |
| S | 2477 | 4.1% |
| Other values (2) | 4954 |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.9086 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Female |
| 3rd row | Female |
| 4th row | Female |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| Male | 5457 | |
| Female | 4543 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 5457 | |
| female | 4543 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 14543 | |
| a | 10000 | |
| l | 10000 | |
| M | 5457 | 11.1% |
| F | 4543 | 9.3% |
| m | 4543 | 9.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 49086 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 14543 | |
| a | 10000 | |
| l | 10000 | |
| M | 5457 | 11.1% |
| F | 4543 | 9.3% |
| m | 4543 | 9.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 49086 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 14543 | |
| a | 10000 | |
| l | 10000 | |
| M | 5457 | 11.1% |
| F | 4543 | 9.3% |
| m | 4543 | 9.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 49086 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 14543 | |
| a | 10000 | |
| l | 10000 | |
| M | 5457 | 11.1% |
| F | 4543 | 9.3% |
| m | 4543 | 9.3% |
Age
Real number (ℝ)
| Distinct | 70 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.9218 |
| Minimum | 18 |
|---|---|
| Maximum | 92 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 25 |
| Q1 | 32 |
| median | 37 |
| Q3 | 44 |
| 95-th percentile | 60 |
| Maximum | 92 |
| Range | 74 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 10.487806 |
|---|---|
| Coefficient of variation (CV) | 0.26945841 |
| Kurtosis | 1.3953471 |
| Mean | 38.9218 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.0113203 |
| Sum | 389218 |
| Variance | 109.99408 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 37 | 478 | 4.8% |
| 38 | 477 | 4.8% |
| 35 | 474 | 4.7% |
| 36 | 456 | 4.6% |
| 34 | 447 | 4.5% |
| 33 | 442 | 4.4% |
| 40 | 432 | 4.3% |
| 39 | 423 | 4.2% |
| 32 | 418 | 4.2% |
| 31 | 404 | 4.0% |
| Other values (60) | 5549 |
| Value | Count | Frequency (%) |
| 18 | 22 | 0.2% |
| 19 | 27 | 0.3% |
| 20 | 40 | 0.4% |
| 21 | 53 | 0.5% |
| 22 | 84 | |
| 23 | 99 | |
| 24 | 132 | |
| 25 | 154 | |
| 26 | 200 | |
| 27 | 209 |
| Value | Count | Frequency (%) |
| 92 | 2 | < 0.1% |
| 88 | 1 | < 0.1% |
| 85 | 1 | < 0.1% |
| 84 | 2 | < 0.1% |
| 83 | 1 | < 0.1% |
| 82 | 1 | < 0.1% |
| 81 | 4 | |
| 80 | 3 | |
| 79 | 4 | |
| 78 | 5 |
Tenure
Real number (ℝ)
Zeros
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.0128 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 413 |
| Zeros (%) | 4.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 5 |
| Q3 | 7 |
| 95-th percentile | 9 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.8921744 |
|---|---|
| Coefficient of variation (CV) | 0.57695786 |
| Kurtosis | -1.1652252 |
| Mean | 5.0128 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.010991458 |
| Sum | 50128 |
| Variance | 8.3646726 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=11)
| Value | Count | Frequency (%) |
| 2 | 1048 | |
| 1 | 1035 | |
| 7 | 1028 | |
| 8 | 1025 | |
| 5 | 1012 | |
| 3 | 1009 | |
| 4 | 989 | |
| 9 | 984 | |
| 6 | 967 | |
| 10 | 490 |
| Value | Count | Frequency (%) |
| 0 | 413 | 4.1% |
| 1 | 1035 | |
| 2 | 1048 | |
| 3 | 1009 | |
| 4 | 989 | |
| 5 | 1012 | |
| 6 | 967 | |
| 7 | 1028 | |
| 8 | 1025 | |
| 9 | 984 |
| Value | Count | Frequency (%) |
| 10 | 490 | |
| 9 | 984 | |
| 8 | 1025 | |
| 7 | 1028 | |
| 6 | 967 | |
| 5 | 1012 | |
| 4 | 989 | |
| 3 | 1009 | |
| 2 | 1048 | |
| 1 | 1035 |
Balance
Real number (ℝ)
Zeros
| Distinct | 6382 |
|---|---|
| Distinct (%) | 63.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 76485.889 |
| Minimum | 0 |
|---|---|
| Maximum | 250898.09 |
| Zeros | 3617 |
| Zeros (%) | 36.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 97198.54 |
| Q3 | 127644.24 |
| 95-th percentile | 162711.67 |
| Maximum | 250898.09 |
| Range | 250898.09 |
| Interquartile range (IQR) | 127644.24 |
Descriptive statistics
| Standard deviation | 62397.405 |
|---|---|
| Coefficient of variation (CV) | 0.81580283 |
| Kurtosis | -1.4894118 |
| Mean | 76485.889 |
| Median Absolute Deviation (MAD) | 46766.79 |
| Skewness | -0.14110871 |
| Sum | 7.6485889 × 108 |
| Variance | 3.8934362 × 109 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 3617 | |
| 130170.82 | 2 | < 0.1% |
| 105473.74 | 2 | < 0.1% |
| 113957.01 | 1 | < 0.1% |
| 85311.7 | 1 | < 0.1% |
| 130142.79 | 1 | < 0.1% |
| 83807.86 | 1 | < 0.1% |
| 159660.8 | 1 | < 0.1% |
| 125510.82 | 1 | < 0.1% |
| 113755.78 | 1 | < 0.1% |
| Other values (6372) | 6372 |
| Value | Count | Frequency (%) |
| 0 | 3617 | |
| 3768.69 | 1 | < 0.1% |
| 12459.19 | 1 | < 0.1% |
| 14262.8 | 1 | < 0.1% |
| 16893.59 | 1 | < 0.1% |
| 23503.31 | 1 | < 0.1% |
| 24043.45 | 1 | < 0.1% |
| 27288.43 | 1 | < 0.1% |
| 27517.15 | 1 | < 0.1% |
| 27755.97 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 250898.09 | 1 | |
| 238387.56 | 1 | |
| 222267.63 | 1 | |
| 221532.8 | 1 | |
| 216109.88 | 1 | |
| 214346.96 | 1 | |
| 213146.2 | 1 | |
| 212778.2 | 1 | |
| 212696.32 | 1 | |
| 212692.97 | 1 |
NumOfProducts
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 266 |
| 4 | 60 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 3 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 5084 | |
| 2 | 4590 | |
| 3 | 266 | 2.7% |
| 4 | 60 | 0.6% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 5084 | |
| 2 | 4590 | |
| 3 | 266 | 2.7% |
| 4 | 60 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5084 | |
| 2 | 4590 | |
| 3 | 266 | 2.7% |
| 4 | 60 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 5084 | |
| 2 | 4590 | |
| 3 | 266 | 2.7% |
| 4 | 60 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 5084 | |
| 2 | 4590 | |
| 3 | 266 | 2.7% |
| 4 | 60 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 5084 | |
| 2 | 4590 | |
| 3 | 266 | 2.7% |
| 4 | 60 | 0.6% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 7055 | |
| 0 | 2945 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 7055 | |
| 0 | 2945 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 7055 | |
| 0 | 2945 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 7055 | |
| 0 | 2945 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 7055 | |
| 0 | 2945 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 7055 | |
| 0 | 2945 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 5151 | |
| 0 | 4849 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 5151 | |
| 0 | 4849 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5151 | |
| 0 | 4849 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 5151 | |
| 0 | 4849 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 5151 | |
| 0 | 4849 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 5151 | |
| 0 | 4849 |
EstimatedSalary
Real number (ℝ)
| Distinct | 9999 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100090.24 |
| Minimum | 11.58 |
|---|---|
| Maximum | 199992.48 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 11.58 |
|---|---|
| 5-th percentile | 9851.8185 |
| Q1 | 51002.11 |
| median | 100193.91 |
| Q3 | 149388.25 |
| 95-th percentile | 190155.38 |
| Maximum | 199992.48 |
| Range | 199980.9 |
| Interquartile range (IQR) | 98386.137 |
Descriptive statistics
| Standard deviation | 57510.493 |
|---|---|
| Coefficient of variation (CV) | 0.57458642 |
| Kurtosis | -1.1815184 |
| Mean | 100090.24 |
| Median Absolute Deviation (MAD) | 49198.15 |
| Skewness | 0.0020853577 |
| Sum | 1.0009024 × 109 |
| Variance | 3.3074568 × 109 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 24924.92 | 2 | < 0.1% |
| 140469.38 | 1 | < 0.1% |
| 51695.41 | 1 | < 0.1% |
| 151325.24 | 1 | < 0.1% |
| 64327.26 | 1 | < 0.1% |
| 38190.78 | 1 | < 0.1% |
| 101348.88 | 1 | < 0.1% |
| 112542.58 | 1 | < 0.1% |
| 113931.57 | 1 | < 0.1% |
| 93826.63 | 1 | < 0.1% |
| Other values (9989) | 9989 |
| Value | Count | Frequency (%) |
| 11.58 | 1 | |
| 90.07 | 1 | |
| 91.75 | 1 | |
| 96.27 | 1 | |
| 106.67 | 1 | |
| 123.07 | 1 | |
| 142.81 | 1 | |
| 143.34 | 1 | |
| 178.19 | 1 | |
| 216.27 | 1 |
| Value | Count | Frequency (%) |
| 199992.48 | 1 | |
| 199970.74 | 1 | |
| 199953.33 | 1 | |
| 199929.17 | 1 | |
| 199909.32 | 1 | |
| 199862.75 | 1 | |
| 199857.47 | 1 | |
| 199841.32 | 1 | |
| 199808.1 | 1 | |
| 199805.63 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 7963 | |
| 1 | 2037 | 20.4% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 7963 | |
| 1 | 2037 | 20.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7963 | |
| 1 | 2037 | 20.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 7963 | |
| 1 | 2037 | 20.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 7963 | |
| 1 | 2037 | 20.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 7963 | |
| 1 | 2037 | 20.4% |
Interactions
Correlations
| Age | Balance | CreditScore | CustomerId | EstimatedSalary | Exited | Gender | Geography | HasCrCard | IsActiveMember | NumOfProducts | RowNumber | Tenure | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | 0.033 | -0.008 | 0.009 | -0.002 | 0.375 | 0.026 | 0.050 | 0.013 | 0.144 | 0.087 | 0.000 | -0.010 |
| Balance | 0.033 | 1.000 | 0.006 | -0.014 | 0.012 | 0.141 | 0.000 | 0.315 | 0.039 | 0.014 | 0.230 | -0.009 | -0.010 |
| CreditScore | -0.008 | 0.006 | 1.000 | 0.006 | 0.001 | 0.086 | 0.000 | 0.018 | 0.000 | 0.025 | 0.017 | 0.005 | 0.001 |
| CustomerId | 0.009 | -0.014 | 0.006 | 1.000 | 0.015 | 0.023 | 0.000 | 0.000 | 0.000 | 0.011 | 0.006 | 0.004 | -0.015 |
| EstimatedSalary | -0.002 | 0.012 | 0.001 | 0.015 | 1.000 | 0.000 | 0.021 | 0.017 | 0.000 | 0.025 | 0.019 | -0.006 | 0.008 |
| Exited | 0.375 | 0.141 | 0.086 | 0.023 | 0.000 | 1.000 | 0.106 | 0.173 | 0.000 | 0.156 | 0.387 | 0.000 | 0.022 |
| Gender | 0.026 | 0.000 | 0.000 | 0.000 | 0.021 | 0.106 | 1.000 | 0.022 | 0.000 | 0.020 | 0.042 | 0.000 | 0.025 |
| Geography | 0.050 | 0.315 | 0.018 | 0.000 | 0.017 | 0.173 | 0.022 | 1.000 | 0.005 | 0.018 | 0.047 | 0.018 | 0.028 |
| HasCrCard | 0.013 | 0.039 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.005 | 1.000 | 0.006 | 0.000 | 0.008 | 0.026 |
| IsActiveMember | 0.144 | 0.014 | 0.025 | 0.011 | 0.025 | 0.156 | 0.020 | 0.018 | 0.006 | 1.000 | 0.038 | 0.000 | 0.021 |
| NumOfProducts | 0.087 | 0.230 | 0.017 | 0.006 | 0.019 | 0.387 | 0.042 | 0.047 | 0.000 | 0.038 | 1.000 | 0.009 | 0.035 |
| RowNumber | 0.000 | -0.009 | 0.005 | 0.004 | -0.006 | 0.000 | 0.000 | 0.018 | 0.008 | 0.000 | 0.009 | 1.000 | -0.007 |
| Tenure | -0.010 | -0.010 | 0.001 | -0.015 | 0.008 | 0.022 | 0.025 | 0.028 | 0.026 | 0.021 | 0.035 | -0.007 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Sample
| RowNumber | CustomerId | Surname | CreditScore | Geography | Gender | Age | Tenure | Balance | NumOfProducts | HasCrCard | IsActiveMember | EstimatedSalary | Exited | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 15634602 | Hargrave | 619 | France | Female | 42 | 2 | 0.00 | 1 | 1 | 1 | 101348.88 | 1 |
| 1 | 2 | 15647311 | Hill | 608 | Spain | Female | 41 | 1 | 83807.86 | 1 | 0 | 1 | 112542.58 | 0 |
| 2 | 3 | 15619304 | Onio | 502 | France | Female | 42 | 8 | 159660.80 | 3 | 1 | 0 | 113931.57 | 1 |
| 3 | 4 | 15701354 | Boni | 699 | France | Female | 39 | 1 | 0.00 | 2 | 0 | 0 | 93826.63 | 0 |
| 4 | 5 | 15737888 | Mitchell | 850 | Spain | Female | 43 | 2 | 125510.82 | 1 | 1 | 1 | 79084.10 | 0 |
| 5 | 6 | 15574012 | Chu | 645 | Spain | Male | 44 | 8 | 113755.78 | 2 | 1 | 0 | 149756.71 | 1 |
| 6 | 7 | 15592531 | Bartlett | 822 | France | Male | 50 | 7 | 0.00 | 2 | 1 | 1 | 10062.80 | 0 |
| 7 | 8 | 15656148 | Obinna | 376 | Germany | Female | 29 | 4 | 115046.74 | 4 | 1 | 0 | 119346.88 | 1 |
| 8 | 9 | 15792365 | He | 501 | France | Male | 44 | 4 | 142051.07 | 2 | 0 | 1 | 74940.50 | 0 |
| 9 | 10 | 15592389 | H? | 684 | France | Male | 27 | 2 | 134603.88 | 1 | 1 | 1 | 71725.73 | 0 |
| RowNumber | CustomerId | Surname | CreditScore | Geography | Gender | Age | Tenure | Balance | NumOfProducts | HasCrCard | IsActiveMember | EstimatedSalary | Exited | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9990 | 9991 | 15798964 | Nkemakonam | 714 | Germany | Male | 33 | 3 | 35016.60 | 1 | 1 | 0 | 53667.08 | 0 |
| 9991 | 9992 | 15769959 | Ajuluchukwu | 597 | France | Female | 53 | 4 | 88381.21 | 1 | 1 | 0 | 69384.71 | 1 |
| 9992 | 9993 | 15657105 | Chukwualuka | 726 | Spain | Male | 36 | 2 | 0.00 | 1 | 1 | 0 | 195192.40 | 0 |
| 9993 | 9994 | 15569266 | Rahman | 644 | France | Male | 28 | 7 | 155060.41 | 1 | 1 | 0 | 29179.52 | 0 |
| 9994 | 9995 | 15719294 | Wood | 800 | France | Female | 29 | 2 | 0.00 | 2 | 0 | 0 | 167773.55 | 0 |
| 9995 | 9996 | 15606229 | Obijiaku | 771 | France | Male | 39 | 5 | 0.00 | 2 | 1 | 0 | 96270.64 | 0 |
| 9996 | 9997 | 15569892 | Johnstone | 516 | France | Male | 35 | 10 | 57369.61 | 1 | 1 | 1 | 101699.77 | 0 |
| 9997 | 9998 | 15584532 | Liu | 709 | France | Female | 36 | 7 | 0.00 | 1 | 0 | 1 | 42085.58 | 1 |
| 9998 | 9999 | 15682355 | Sabbatini | 772 | Germany | Male | 42 | 3 | 75075.31 | 2 | 1 | 0 | 92888.52 | 1 |
| 9999 | 10000 | 15628319 | Walker | 792 | France | Female | 28 | 4 | 130142.79 | 1 | 1 | 0 | 38190.78 | 0 |